Discourse Structure in Spoken Language: Studies on Speech Corpora

نویسندگان

  • Christine H. Nakatani
  • Julia Hirschberg
  • Barbara J. Grosz
چکیده

A better understanding of the intonational charaeteristics of spoken discourse may lead to new empirical techniques for identifying discourse structure from speech, as well as new algorithms for enhancing the naturalness of synthetic speech. This paper summarizes results of pilot studies that demonstrate reliable correlations of discourse and speech properties, and reports findings on a new corpus of direction-giving monologues, collected in both spontaneous and read speaking styles. Preliminary analyses of the direction-giving corpus show that the availability of speech significantly affects the reliability of discourse segmentation for a set of trained discourse labelers.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Prosody of Discourse Structure and Content in the Production of Persian EFL Learners

The present research addressed the prosodic realization of global and local text structure and content in the spoken discourse data produced by Persian EFL learners. Two newspaper articles were analyzed using Rhetorical Structure Theory. Based on these analyses, the global structure in terms of hierarchical level, the local structure in terms of the relative importance of text segments and the ...

متن کامل

Discovering the Sounds of Discourse Structure Extended Abstract

It is widely accepted that discourses are composed of segments and that the recognition of segment boundaries is essential to a determination of discourse meaning (Grosz and Sidner, 1986). Written language has orthographic cues such as section headings, paragraph boundaries, and punctuation which can assist in identifying discourse structure. In spoken language, into-national variation provides...

متن کامل

Genre Analysis of ELT and Nursing Academic Written Discourse through Introduction

Since Swales’ (1981, 1990) CARS model work on the move structure of research articles, studies on genre analysis have been carried out amongst which works on different parts of research articles in various disciplines has gained a considerable literature. This study aims to investigate the rhetorical structure of the Introduction sections of articles in two fields of English Language Teaching (...

متن کامل

Cross-Domain and Cross-Language Porting of Shallow Parsing

English was the main focus of attention of the Natural Language Processing (NLP) community for years. As a result, there are significantly more annotated linguistic resources in English than in any other language. Consequently, data-driven tools for automatic text or speech processing are developed mainly for English. Developing similar corpora and tools for other languages is an important issu...

متن کامل

Grammars of Spoken English: New Outcomes of Corpus-Oriented Research

Recently work on the grammar of spoken English has advanced through the use of large, general, and varied corpora of the language, including corpora of spoken discourse. Here I review the research that has been emerging from the availability of such corpora, much of it emphasizing the need for new ways of conceptualizing spoken grammar, to replace the traditional reliance on grammatical models ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002